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(54) Vector for use in the assay of HIV protease 

(57) A chromogenic assay is described for the identification and Isolation of drug-resistant HIV protease 
mutants. Coversely, the assay is useful to screen for new inhibitors of HIV protease, e.g., inhibitors not affected 
by drug-resistance of the HIV protease. This color screening assay contains a vector comprising a regulatable 
promoter which controls the transcription of two adjacent structural sequences, one sequence coding for HIV 
protease or mutant thereof, the other sequence coding for beta-galactosidase with an amino acid substrate 
insert cleavable by HIV protease. 



At least one drawing originally filed was informal and the print reproduced here is taken from a later filed formal copy. 



CD 
K) 



> 



BNSDOCID: <GB 2276621 A_l_> 



2276621 



- 1 - 

TITLE OF THE INVENTION 

COLOR SCREENING ASSAY FOR IDENTIFYING DRUG- 
RESISTANT mV PROTEASE MUTANTS AND INHIBITORS 
THEREOF 

BACKGROUND OF THE INVENTION 

A retrovirus designated human immunodeficiency virus 
(HIV) is the etiological agent of the complex disease that includes 
progressive destruction of the immune system (acquired immune 
deficiency syndrome; AIDS) and degeneration of the central and 
peripheral nervous system. This virus was previously known as LAV, 
HTLV-in, or ARV. A common feature of retrovirus replication is the 
extensive post-translational processing of precursor polyproteins by a 
virally encoded protease to generate mature viral proteins required for 
vims assembly and function. Inhibition of this processing prevents the 
production of normally infectious viras. For example. Kohl, N.E., et. 
al., Proc. Natl. Acad. Sci. USA, £5., 4686 (1988), demonstrated that 
genetic inactivation of the HTV encoded protease resulted in the 
production of immature, non-infectious virus particles. These results 
suggest that inhibition of the HIV protease represents a viable method 
for the treatment of AIDS and the prevention or treatment of infection 
by HIV. 

Nucleotide sequencing of HTV shows the presence of a pol 
gene in one open reading frame piatner, L. £tal.. Nature, 313 . 277 
(1985)]. Amino acid sequence homology provides evidence that the pol 
sequence encodes reverse transcriptase, an endonuclease and an HTV 
protease [Toh, H. stal., EMBO J. ^ 1267 (1985); Power, M.D. stal-. 
Science, 221, 1567 (1986); Pearl, L.H. et 21-, Nature 129, 351 (1987)]. 
Applicants construct a vector and expression system for HIV protease. 
Related art includes Baum, E.Z. £l fii-, Proc. Natl. Acad. Sci. S2, 10023 
(1990). 

The particular advantages of the present invention include 
the coordinate expression of functional HTV protease and a reporter 
beta-galactosidase having an insert cleavable by the HTV protease. The 
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coordinate expression results from the transcription of a single 
dicistronic mRNA. Control over the expression of enzyme (HTV 
protease) and substrate (the cleavable insert) is readily achieved wifli 
this type of recombinant construction. 

Further, the present invention is directed to a rapid method 
of identifying drug-resistant HTV protease mutants. Because only the 
Hindm site on the 5' side of the P-galactosidase gene is reconstructed in 
the cloning, the HTV protease gene is flanked by unique Ndel and 
Hindm sites, enabling easy removal and insertion of alternate protease 
genes. Thus, libraries of mutagenized protease genes for screening are 
constructed and inserted into this vector as Ndel-HindDI fragments. 

Finally, because the promoter controlling the coordinate 
expression is itself regulatable, manipulation of tiie internal 
concentration of HTV protease is achieved. Hiis arrangement avoids the 
toxic effects of intracellular HTV protease. Applicants induce the 
regulatable promoter, here the tryptophan promoter, only when further 
growth of the host cell E.coli is no longer needed. 

BRIEF DESCRIPTION OF THE INVENTION 

A chromogenic assay is described for the identification and 
isolation of drug-resistant HIV protease mutants. Conversely, the assay 
is useful to screen for new inhibitors of HTV protease, e.g., inhibitors 
not affected by drug-resistance of the HTV protease. This color 
screening assay contains a vector comprising a regulatable promoter 
which controls the transcription of two adjacent structural sequences, 
one sequence coding for HIV protease or mutant thereof, the other 
sequence coding for beta-galactosidase with an amino acid substrate 
insert cleavable by HTV protease. A library of HTV proteases is also 
described and is isolated in tiie form of a collection of such vectors, 
which is a color screen vector .library. 
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BRTEF DESCRTPinON OF THE DRAWINGS 

Figure 1: Map of pPrBGl, an example of the color screen 
vector of the present invention. 

DETAILED DHSCRTPTJON OF THE INVENTION 

Applicants have constructed a color screen vector useful 
for assaying HIV protease inhibitors, for constructing a library of 
expressed HTV protease mutants, and for detectmg and isolating mutants 
of HTV protease that are resistant to HIV protease inhibitors. The 
vector contains two stmctural sequences that are coordinately 
transcribed into a dicistronic mRNA from the same promoter. The first 
sequence to be transcribed is HIV protease, the second sequence is a 
reporter protein that contains a substrate site cleavable by the HIV 
protease. 

Since the vector contains a reporter gene which is assayed 
by the appearance of a chromogenic substrate, it is a color screen 
vector. In this invention, beta-galactosidase is inserted into the vector. 
This enzyme acts on 5-bromo-4-chloro-3-indolyl-beta-D-galacto- 
pyranoside to produce a blue product readily observeable by the eye. 

A cleavable reporter provides a measure of the amount of 
cleaving enzyme, in this case HTV protease inhibitor. Beta-galactosidase 
is constructed with an oligopeptide insert containing the substrate site 
for HIV protease. This substrate sequence is as follows: 

Glu Val Ser Phe Asn Phe Pro Gki He Thr 

( SEQ. ID. NO.: 14). The oligopeptide insert does not materially affect 
the enzymatic activity of beta-galactosidase, but once it is cleaved the 
beta-galactosidase is inactive. 

Thus, in the presence of HIV protease inhibitor, bacterial 
colonies containing the color screen vector of this invention arc blue in 
a suitable host such as E. coli. They are normally of white color due to 
the activity of the expressed HTV protease. If the vector contains a 
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drag-resistant HIV protease mutant instead of its more active original 
NY5 HTV protease, the colonies will be a lighter blue. 

The present invention is suitably embodied in pPrBGl, as 
set forth in Figure 1. The trp promoter operatively expresses or 
transcribes a dicistronic mRNA, indicated by the inner arrow. This 
mRNA or message codes for the HIV protease and, downstream, a beta- 
galactosidase having inserted thereto an oligopeptide site cleavable by 
HTV protease. 

The unique Ndel and Hindm sites at the ends of the HIV 
protease sequence provide a readily available and convenient site for 
insertion of a library of mutagenized HTV protease sequences. Thus the 
original HTV protease sequence can be mutagenized, then trimmed by 
digestion with Ndel and HindlQ. The resulting heterogeneous collection 
of sequences with unifomi ends is readily ligated with pPrBGl digested 
with Ndel and HindDI, to form a color screen library of HTV protease 
mutants. Screening the resulting library for lighter blue colonies in the 
presence of the inhibitor will yield drag-resistant HTV protease mutants. 

A.Preparation and Sequencing of DNA 

Following well known and conventional practice, coding 
sequences are prepared by ligation of other sequences, restriction 
endonuclease digestion, cloning, mutagenesis, organic syndiesis, or 
combinations thereof, in accordance with the principles and practice of 
constracting DNA sequences. For sequencing DNA, e.g., verification 
of a constract at the end of a series of steps, dideoxy DNA sequencing is 
the preferred method. Oflier DNA sequencing methods are well known. 

Many treatises on recombinant meAods have been 
published, including J. Sambrook et al.. Molecular Cloning: A 
Laboratory Manual 2d Ed. 1978; L.G. Davis et al., Basic Methods in 
Molecular Biology Elsevier 1986; F.M. Ausubel, et al. (eds.), Current 
Protocols in Molecular Biology, Wiley Interscience 1988 (looseleaf). 

Phosphoramidite chemistry in solid phase is the preferred 
method for the organic synthesis of oligodeoxynucleotides and 
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polydeoxynucleotides. Many other organic synthetic methods are 
available and are readily adapted to the particular sequences of this 
invention by a person skilled in the art. 

Amplification of DNA is a common step in the 
constructions of this invention, and is typically performed by the 
polymerase chain reaction (PCR). See, e.g., MuUins et al., U.S. Patent 
No. 4,800,159 and other published sources. TTie basic principle of PCR 
is the exponential replication of a DNA sequence by successive cycles of 
primer extension. The extension product of one primer, when 
hybridized to another primer, becomes a template for the synthesis of 
another nucleic acid molecule. The primer template complexes act as 
substrate for DNA polymerase which, in performing its replication 
function, extends the primers. The region in common with both primer 
extensions, upon denaturation, serves as template for a repeated primer 
extension. The conventional enzyme for PCR applications is the 
thermostable DNA polymerase isolated from Thermus aquaticus . or Tag 
DNA polymerase. Numerous variations in the PCR protocol exist, and 
a particular procedure of choice in any given step in the constructions 
of this invention is readily performed by a skilled artisan. 

B. Construction of HTV protease sequences, and expression vector. 

Applicants have arbitrarily selected the particular protease 
sequence of the NY5 strain of HTV-l to construct a structural sequence 
for a procaryotic expression vector. Virtually any other HIV-1 
protease sequence can effectively substitute for that of the NY5 strain, 
provided that the substituted strain is not derived from a patient treated 
with an HIV protease inhibitor. Tlie constructed sequence need not be 
the same as the original, or its complementary sequence, but instead 
may be any sequence determined by the degeneracy of flie DNA code. 
Conservative amino acid substitutions may also be employed, or other 
minor modifications, such as an amino terminal mediionine used herein. 

A ribosome binding site active in the host expression 
system is ligated to the 5' end of the HIV protease sequence, giving a 
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synthetic gene. For convenience, applicants have ligated the E.coli 
ribosome binding site, and inserted a unique Ndel site overlapping the 
initiation codon ATG. At or near the 3' end is a unique Hindm site. 

An important feature in the construction of the HIV 
protease expression vector of this invention is the unique restriction 
endonuclease sites at or near each end of the HIV protease sequence. 
This feature allows for convenient and rapid substitution of mutagenized 
HIV protease sequences for subsequent screening of drug-resistant HIV 
protease mutants. 

A large variety of hosts are now readily available for 
recombinant expression systems. A regulatable promoter is the most 
suitable for the present invention. For convenience, applicants have 
chosen to express the HIV protease under the control of the E.coli trp 
promoter. Other suitable regulatable promoters include lac, tac, recA, 
T7, XPr, or XPL- 

The synthetic gene is then ligated to appropriately 
linearized plasmid, e.g., pTRP which is digested with Clal and Hindm. 
The resulting plasmid, called pSyn7, expresses amino acids 1-99 of the 
NY5 strain of HIV-1 protease, and is preceded by the initiator 
methionine under ttie control of tiie E. coli trp promoter. 

C. The Beta-galactosidase Sequence with an Insert 
Cleavable by HIV Protease 

The reporter gene beta-galactosidase converts the 
chromogenic substrate 5-bromo-4-chloro-3-indolyl-beta-D- 
galactopyranoside (XGal) into a blue colored product. At certain sites 
in the primary sequence of beta-galactosidase, small oligopeptides can 
be inserted without inactivating the enzymatic activity. One such site is 
amino acid 79 of beta-galactosidase, see, e.g., Baum, EJZ. et al, supra. 
A vector coding for a readily assayable beta-galactosidase is selected. In 
this case, applicants choose pCHl 10 (Phamiacia), and cut it at the 
indicated site with Saul. A DNA duplex coding, when in frame, the 
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following HIV protease substrate sequence is ligated into pCHl 10 
linearized with Saul: 

Glu Val Ser Phe Asn Phe Pro Gin He Thr Leu GIu 

(SEQ. m. NO.: 14). Applicants use Oligonucleotides 1 and 2 (SEQ. ID. 
NO.:l and SEQ. ID. NO.: 2, respectively, see Example 2) for such 
insertion and ligation into the DNA sequence corresponding to position 
80 of the beta-galactosidase amino acid sequence, to afford a 
recombinant beta-gal plasmid. 

It will be understood that a variety of otiher 
oligonucleotides may be used with like effect. For example, degenerate 
DNA fragments may be employed, or DNA sequences coding for 
alternative substrate sites for HTV protease, including those readily 
determined by conservative amino acid substitution. Organic synthesis 
with a gene machine is a convenient way of making these 
oligonucleotides for insertion and Ugation. 

Suitable altemative substrate sites for HTV protease include 
the following: 

pl7/p24: Ser Gin Asn Tyr Pro He Val Gbi (SEQ. ID. NO.: 15) 
p24/X: Ala Arg Val Leu Ala Glu Ala Met (SEQ. ID. NO.: 16) 
X/p7: Ala Thr Ee Met Met Gin Arg Gly (SEQ. ID. NO.: 17) 
p7/p6: Pro Gly Asn Phe Leu Gin Ser Arg (SEQ. ID. NO.: 1 8) 
p6/PR: Ser Phe Asn Phe Pro Gin He Thr (SEQ. ID. NO.: 19) 
PR/RT: Thr Leu Asn Phe Pro He Ser Pro (SEQ. ID. NO.: 20) 
RT51/RNaseH: Ala Glu Thr Phe Tyr Val Asp Gly (SEQ. ID. NO.: 21) 
RT/IN: Arg Lys He Leu Phe Leu Asp Gly (SEQ. ID. NO.: 22) 
DEGl : Ghi He Thr Leu Tip Gin Arg Pro (SEQ. ID. NO.: 23) 
DEG2: Asp Thr Val Leu Glu Glu Met Ser (SEQ. ID. NO.: 24) 
DEG3 Asp Gin He Leu He Glu He Cys (SEQ. ID. NO.: 25) 

see, e.g., Debouk, C, AIDS Res. and Human Retroviruses & 153 (1992). 

Substrate sites typically add a two amino acid flanker on each side, e^g., 

sequences 14 and 19. 
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Further conventional steps include transformation and 
cloning in a host E.cqli cell, e.g., SCSi cells (Stratagene). 
Transformants can be picked with a positively selectable marker, in this 
case ampicillin resistance. Using radioactively labeled oligonucleotide 
probe, colonies bearing the desired insert are identified and picked for 
cloning. Proper insertion of the oligodeoxynucleotide cassette coding 
for the HIV protease substrate site (SEQ. ID. NO.:14) is verified by 
dideoxy sequencing using a primer coding for a sequence upstream or 
downstream of such insertion, e.g.. Oligonucleotide 3 (SEQ.ID.NO.: 3). 

The resulting recombinant beta-gal plasmid contains a 
modified beta-galactosidase which is a beta-galactosidase sequence with 
an insert cleavable by HTV protease. 

D.Coordinate Expression Vector, e.g., pPrBGl 

The modified beta-galactosidase is recovered from the 
recombinant beta-gal plasmid of Section C, above, by PGR amplification 
wifli oligonucleotide primers that insert a 5' ribosome binding site and 
two 3' stop codons. See, for example, oligonucleotides 4 and 5 (SEQ ID 
NO.: 4 and 5, respectively). Hie resulting PGR product is trimmed by 
appropriate restriction endonucleases, in one instance illustrated as tiie 
digestion with Bsal and HindlQ to give a PGR product which is the 
modified beta-galactosidase having HindlQ-compatible termini. 

The expression vector for HTV protease, e.g., pSyn7 in 
Section B above, is digested with an appropriate restriction 
endonuclease. For pSyn7, the endonuclease Hindm is used. Treatment 
with an alkaline phosphatase, such as calf intestinal alkaline phophatase, 
removes unwanted terminal phosphates. There follows a ligation 
reaction of (1) the modified beta-galactosidase having HindlQ- 
compatible termini, and (2) pSyn7 linearized with HindM. 

Further conventional steps include transformation and 
cloning in a host E. coli cell, e.g., SGSl cells (Stratagene). 
Transformants can be picked with a positively selectable marker, in this 
case ampicillin resistance. Using radioactively labeled oligonucleotide 
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probe, colonies bearing the desired insert are identified and picked for 
cloning. Proper insertion of the oligodeoxynucleotide cassette coding 
for tiie modified beta-galactosidase having Hindm-compatible termini is 
verified by dideoxy sequencing using the appropriate primer. 
Applicants picked oligonucleotides 6 and 7 (SEQ. ID. NO.: 6 and 7, 
respectively) for verification. 

The resulting plasmid has two Ndel sites, one at or near the 
5' temiinus of the HIV protease insert, the other near the 3' end of the 
modified beta-galactosidase. Removal of the latter Ndel site is 
necessary, so that new HIV protease genes can be substituted 
conveniently. By gapped-duplex oligonucleotide mutagenesis with the 
appropriate primer (herein oligonucleotide 8, which is SEQ. ID. NO.: 
8) the unwanted restriction site is removed. Removal is verified by 
colony hybridization with radioactively labeled oligonucleotide 8 and/or 
restriction mapping. 

The resulting plasmid is pPrBGl, as mapped in Figure 1. 
This plasmid coordinately expresses a functional HIV protease and a 
reporter beta-galactosidase having an insert cleavable by the HTV 
protease. The HTV protease sequence is flanked by unique Ndel and 
Hindm sites. 

E. DNA Libraries of Mutagenized HIV Protease Genes 

Mutagenesis in vitro of the gene for the functional HTV 
protease is readily accomplished by contacting the DNA with any one or 
more of a variety of mutagens, or by other means. The available 
methods include the following: generation of nested sets of deletion 
mutants (restriction digestion and Bal 31 treatment); linker-scanning 
mutagenesis; oligonucleotide-directed mutagenesis; the Kunkel method 
of oligonucleotide-mediated mutagenesis by selection against template 
strands containing uracil; insertion of linkers; insertion of linkers 
formed from degenerate pools of mutagenized oligonucleotides; 
treatment of double-stranded DNA with mutagens; treatment of single- 
stranded DNA with mutagens; misincorporation of nucleotides by DNA 
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polymerase; and organic synthesis and assembly of target sequences with 
mutually priming long oUgonucleotides 

The success in chemical mutagenesis depends on whether 
the target DNA is single-stranded or double-stranded, the nature of the 
chemical mutagen, its concentration, time of exposure, and the like. It 
is advantageous to have a screening system for the resulting library of 
mutants. For a general discussion of in vitro mutagenesis methods 
adaptable for the purposes of constructing the particular DNA libraries 
of the present invention, see, e.g., J. Sambrook, supra , chapter 15; and 
F.M. Ausubel, supra, chapter 8. 

One preferred metiiod of mutagenesis is contacting the 
target DNA in single-stranded form, then sequencing and/or cloning. 
For this purpose, the small icosahedral or filamentous single-stranded 
DNA bacteriophages, such as 0X174 or fl(fd. Ml 3), are well 
characterized and make ideal vectors. 

Both the positive and negative strands of the HIV protease 
gene to be mutagenized are cloned, then subjected to limited contact 
with one or more mutagens. Mutagens useful for treating single- 
stranded DNA include, but are not limited to: 

sodium bisulfite, 

nitrous acid, 

formic acid, and 

hydrazine. 

Limited contact with any of these mutagens avoids multiple nucleotide 
substitutions on each strand, and this can be readily accomplished by 
titration of the chemical mutagen, including its concentration, reaction 
time, and temperature. 

After removal of the mutagens, the single-stranded circular 
DNA is rendered double-stranded and trimmed for convenient insertion 
and expression. A universal sequencing primer and avian reverse 
transcriptase are preferred for polymerization. Double-stranded 
products are digested with Ndel and Hindm, and purified by agarose 
gel electrophoresis. Ligation into linearized pPrBGl or other 
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appropriate coordinate expression vector creates a color screen vector 
library of mutagenized HIV protease genes suitable for screening. 

F. Color Screening Assay 

The principle of the procedure is to replica plate a master 
plate of transformed colonies, then treat the replica plate with promoter 
inducer, HIV protease inhibitor and chromogenic substrate. The lighter 
colored colonies from the plate treated with inhibitor and substrate are 
identified. Keying back from this treated plate to the master plate (or 
other untreated plate) localizes the desired colony, i.e., a transformant 
that expresses a drug-resistant HIV protease. 

Typically, a master plate is prepared by merely plating out 
transformants of a suitable recipient strain bearing the color screen 
vector library of mutagenized HIV protease genes. For most 
applications, the master plate is an agar dish containing growth media 
suitable for flie recipient strain. In some instances, the master plate may 
be a nitrocellulose disk which itself is a replica of a pattern of colonies 
on an agar dish, or, instead, it may be a replica of a pattern of colonies 
on another nitrocellulose disk. 

A suitable recipient is readily picked for tiie color screen 
vector library. For convenience, applicants have picked E. coli K-12 
strain LS743. An essential charateristic of a suitable recipient and the 
color screen vector library is a system for positively selecting only 
transformed clones, e.g., applicants eliminate untransfonned colonies by 
incubating in the presence of an antibiotic such as ampicillin. An 
additional advantageous characteristic is the facilitative uptake of 
hydrophobic drag entities (such as HIV protease inhibitor), here 
embodied in envA> . It will be understood that selecting and obtaining 
an appropriate recipient cell strain is witiiin the skill of tiie art. 

The master plate having been prepared, it is replica plated 
onto a membrane filter or other matrix to give a colony lift. 
Nitrocellulose is the preferred membrane filter, but nylon is a feasible 
substitute. A large number of master plates with a corresponding 
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number of colony lifts may be prepared, depending on the size of the 
color screen vector library and the desired amount of screening. 

The resulting colony lifts are treated with a selected HIV 
protease inhibitor, e.g., L-689,502, the structure and synthesis of which 
is disclosed in Thompson, WJ. et al. . J.Med. Chem. 25, 1685 (1992). A 
wide variety of other HIV protease inhibitors are useful in this assay. 
There are numerous patent and literature publications disclosing the 
synthesis and characterization of other inhibitors of HIV protease. The 
reaction conditions suitable for a given inhibitor in flie treatment of the 
colony lifts are readily determined by a skilled artisan. 

Once the colony lifts are treated with inhibitor, tiiey are 
developed by inducing the expression of the reporter enzyme in the 
presence of a chromogenic substrate, to give induced colony lifts. To 
do this, treated colony lifts are simply transferred physically to 
induction plates. For convenience, applicants prepare die induction 
plates as a solid agar containing growth media, with inducing agent 
(beta-rndoleacrylic acid), antibiotic (ampicillin to maintain plasmid 
selection), chromogenic substrate (5-bromo-4-chloro-3-indolyl-beta-D- 
galactopyranoside) and HIV protease inhibitor. The appropriate choice 
of inducing agent, antibiotic, and chromogenic substrate will depend on 
the vector library construction, tiie promoter for coordinate expression, 
and the reporter; and their selection is within tiie skill of the art. 

The induced colony lifts show a pattern of blue colored 
colonies, with a few lighter blue. The location and plate number of the 
the lighter blue colonies is noted and used to refer back to the master 
plate to pick the same colony. Any colonies exhibiting a lighter color 
than wild type colonies are recovered from the master plate and 
regrown for DNA sequencing, repeated color screening, in vitro 
expression and characterization of dmg resistance in the purified 
enzyme. 

Conversely, the color screening assay is useful for 
screening new inhibitors of HIV protease. Colonies of a single cloned 
vector are grown, treated with a series of potentially inhibitory 
compounds to be assayed, then induced in the presence of chromogenic 
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substrate. Blue colonies reveal the presence of an effective inhibitor, 
and such colonies are readily identified against white colonies containing 
active HTV protease without effective inhibitor. 

EXAMPLE 1 

Construction of pSyn7, a Vector for the Expression of HTV Protease 
Sequences . 

I. A synthetic gene coding for protease from the NY5 strain of HIV-1 is 
assembled from six oligonucleotides ranging in length from 105 to 
125 bases. The gene contains, from the 5'-end, a Clal site, 33 base 
pairs containing an E. coli ribosome binding site, a unique Ndel site 
overlapping the translational initiation codon (ATG), 297 base pairs 
encoding the 99 amino acids of the protease, a translational 
temiination codon (TAA) and a Hindin site. This sequence (shown 
after digestion with Clal and HindlU) is as follows: 



20 



25 



30 



10 


20 


30 


40 


50 


CGATAATGTA 


TGGATTAAAT 


AAGGAGGAAT 


AAGACATATG 


CCTCAGATCA 


60 


70 


80 


90 


100 


CTCTGTGGCA 


GCGGCCGCTG 


GTTACTATCA 


AAATCGGTGG 


CCAGCTGAAA 


110 


120 


130 


140 


150 


GAAGCTCTTC 


TAGACACTGG 


TGCTGACGAC 


ACTGTTCTCG 


AGGAAATGAA 


160 


170 


180 


190 


200 


CCTGCCCGGG 


CGTTGGAAAC 


CTTIAAATGAT 


CGGTGGTATC 


GGTGGTTTCA 


210 


220 


230 


240 


250 


TCAAAGTTCG 


TCAGTATGAT 


CAGATCCTGA 


TCGAGATCTG 


CGGTCATAAA 


260 


270 


280 


290 


300 


GCTATCGGTA 


CCGTTCTGGT 


TGGTCCTACT 


CCTGTTAACA 


TCATCGGTCG 


310 


320 


330 


339 




TAACCTGCTG 


ACCCAGATCG 


GCTGCACTCT 


GAACTTCTA 


(SEQ ID. NO 
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Oligonucleotides are synthesized by the solid phase method 
on a DNA syndiesizer using phosphoramidite chemistry, purified by 
electrophoresis through a 12% denaturing polyacrylamide gel and 
visualized by UV shadowing. Following excision of the bands 
containing the fiill length products, oligonucleotides are recovered from 
the aciylamide by soaking and desalted by dialysis against water. 

Oligonucleotides are phosphorylated with polynucleotide 
kinase and complementary fragments are annealed and ligated in two 
consecutive reactions to ClayHindm-digested pTRP ODarke,P. al., J. 
Biol. Chem. 264 . 2307 (1989)) using conventional procedures 
(Sambrook J. et al.. Molecular Cloning: A Laboratory Manual, 2nd Ed. 
Cold Spring Harbor 1989). The sequence of the entire synthetic gene 
was confirmed by dideoxy sequencing. The resulting plasmid, called 
pSyn7, expresses amino acids 1-99 of the protease preceded only by die 
initiator Met under the control of the E. coli trp promoter. 

EXAMPLE 2 

Insertion of an HIV Protease-cleavable site into the E. coli p- 
galactosidase gene , 

A. Plasmid pCHllO (from Pharmacia) is linearized with Saul. The 
reaction is extracted with phenol and chloroform, then ethanol- 
precipitated. A duplex of the following synthetic oligodeoxyribo- 
nucleotides is ligated into this unique site. 

Oligo 1: 5 • -TGAGGTGAGCTTTAACTTCCCTCAGATCACTCT-3 • (SEQ. ID- NO.: 1) 
Oligo 2: 5 ' -TCAAGAGTGATCTGAGGGAAGTTAAAGCTCACC-3 • (SEQ. ID. NO.: 2) 

B. Competent E. coli SCSI cells (Stratagene) are transformed with this 
ligation mix, and transfonnants are selected on LB agar containing 
100 M-g/ml ampicillin. Colonies bearing the desired insert are 
identified by colony hybridization with 32p.iabeled Oligo-nucleotide 
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1 as probe, in 6X SSC/5X Denhardt's Solutioii/0.1% SDS at 65'C, 
followed by washing in 6X SSC at 65'C. Radioactive colonies are 
identified by autoradiography using X-ray film. 

C. The desired recombinant plasmid is recovered from a hybridizing 
colony by growth overnight in LB broth containing 100 ng/ml 
ampicillin, and purified by alkaline lysis and ethidium bromide-CsCl 
centrifiigation. Proper insertion of the oligonucleotide cassette is 
verified by dideoxy DNA sequencing using the following 
oligonucleotide primer: 

Oligo 3: 5 ' -GCTTTGCCTGGTTTCCG-3 ' (SEQ. ID. NO.: 3) 

FXAMPLE 3 

Cloning of cleavable P-galactosidase gene into HIV protease expression 
vector pSynV to give pPrBGl, a screening vector for drug-resistant HIV 
protease mutants 

A. The modified p-galactosidase gene is recovered from the plasmid of 
Example 2 by polymerase chain reaction amplification using the 
following primers: 

Oligo 4: 5 ■ -CGATCAAGCTTAAGCCGTAGATAAACAGGC-3 ' (SEQ. ID. NO.: 4) 

Oligo 5: 5 ' -ATCCTGGGTCTCGAGCTATTATTTTTGACACCAGACCAACTG-3 • 
(SEQ. ID. NO: 5) 

The PGR amplification reaction is carried out for 25 cycles 

as follows: 

1 min at 94°C; 

2 min at 37°C; 

3 min at 72 C 

This procedure inserts a . ribosome binding site 5' of the p- 
galactosidase gene and two stop codons at its 3' end. 
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B. Hindin-compatible termini are generated in the PGR product by 
digestion with Bsal and HindHI. 

C. Plasmid pSyn7 is digested with Hindm, then treated with calf 
intestinal alkaline phosphatase. A quantity of 0.05 |ig of the digested 
PGR product of Step B are ligated to 0.1 |ig HindlQ-cut pSyn7 DNA 
in a final volume of 10 using T4 DNA ligase, for 3 hr at 16'G. 

D. Gompetent E, coli SGSl (Stratagene) are transformed with this 
ligation mix, and transformants are selected on LB agar containing 
100 |xg/ml ampicillin. The desired recombinants are identified by 
colony hybridization with 32p.oligo 1 as described in Example 2, 
Part B. 

E. Plasmids are recovered from hybridizing colonies by growth 
overnight in LB broth containing 100 |xg/ml ampicillin, and purified 
by alkaline lysis and ethidium bromide-GsGl centrifugation. 

F. Proper insertion of the insert is verified by dideoxy DNA sequencing 
using the following oligonucleotide primers: 

Oligo 6: 5 ' -TTTTCGCTCATGTGAAGT-3 ■ (SEQ. ID. NO.: 6) 
Oligo 7: 5 ' -TGCGTTCTGATTTAATCTG-3 ' (SEQ. ID. NO.: 7) 

G. Using the gapped-duplex oligonucleotide mutagenesis method 
(Golonno, R.gt al., Proc. Natl. Acad. Sci. £5, 5449 (1988)), the Ndel 
site within the P-galactosidase gene is eliminated. The mutagenic 
oligonucleotide and hybridization probe for this site removal is the 
following: 

Oligo 8: 5 ' -GTTTCCACATGGGGATT-3 ' (SEQ. ID. NO.: 8) 
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Removal of the Ndel site is verified by colony 
hybridization with 32p-oligo 8 and by loss of the site in restriction 
mapping using Ndel. 

H. The resultant plasmid, pPrBGl, is selected by growth on LB agar 
containing 100 Hg/ml ampicillin, grown in liquid culture and 
purified by alkaline lysis and ethidium bromide-CsCl centrifugation 
as described above. 

I. In pPrBGl, the cleavable p-galactosidase gene and its ribosome 
binding site are inserted immediately 3' of the expressed HIV 
protease gene, under the control of the tryptophan promoter of 
pSyn7. A diagram of this plasmid is shown in Figure 1. The HIV 
protease and its cleavable p-galactosidase reporter are therefore 
coordinately expressed from a single dicistronic mRNA. Because 
only the Hindffl site on the 5' side of the P-galactosidase gene is 
reconstructed in the cloning, the HIV protease gene is flanked by 
unique Ndel and Hindffl sites, enabling easy removal and insertion 
of alternate protease genes. Thus, libraries of mutagenized protease 
genes for screening are constructed and inserted into this vector as 
Ndel-Hindm fragments. 

EXAMPLE 4 

25 In Vitro Mutagenesis of the HIV-1 Protease Gene and Construction of 
Mutant Libraries in pPrBGl 

I. The synthetic protease gene from plasmid pSyn? is recovered from 
the plasmid by PGR amplification using the following 
30 oligonucleotide primers: 

Oligo 9: 5 ' -AGGAGGAATTCGACATATGCCTCAGATCAC-3 ' (SEQ. ID. NO.: 9) 
Oligo 10: 5'-CAGCCAAGCTTAGAAGTTCAGAGTGCAGCC-3' (SEQ. ID. NO.: 10) 

Amplification is carried out as described in Example 3, 
Step A. This amplification adds an EcoRI site to the 5' end of the 
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protease gene» peraiitting subsequent cloning. The amplified product is 
digested with EcoRI and Hindm, purified by agarose gel 
electrophoresis, and cloned into the EcoRI and Hindm sites of 
phagemids pGEM-7ZF(-) (Promega), to yield pGEM-Pr (+) and 
pGEM-Pr (-), respectively. 

n. Single-stranded DNA of phagemids pGEM-Pr (+) and pGEM-Pr (-) 
is prepared by superinfection with phage M13K07 (Promega). 

m. The purified single-stranded DNAs are subjected to mutagenesis in 
vitro wifli nitrous acid, hydrazine, or formic acid as previously 
described (Myers gt 2I., Science 229 . 242 (1985)), made double 
stranded with AMV reverse transcriptase and the primers: 

Oligo 11; 5 ' -TAATACGACTCACTATA-3 ' , f or pGEM-Pr (+) , 
(SEQ. ID. NO.s 11) or 

Oligo 12: 5 ' -ATTTAGGTGACACTATA-3 ' , for pGEM-Pr(-) (SEQ. ID. NO.: 12) 

IV. The double stranded products of the pGEM-Pr (+) and pGEM-Pr 

(-) reactions are pooled and digested with Ndel and Hindm. The 
mutagenized protease genes are recovered by agarose gel 
purification and ligated into the gel-purified Ndel/Hindm digested 
vector fragment of pPrBGl, to generate a randomly-mutagenized 
library of HIV protease genes in the screening vector. 

V. E. coJi K-12 strain LS743 (M^ ^ palK. lacAU169. envA. TnlO . 
rpsL), was used as the recipient strain for the transformation. 
Competent LS743 cells are prepared as previously described 
(Hanahan D., J. Mol. Biol. 166 . 557 (1983)) and transfomied by the 
mutagenized protease library, plating on LB agar containing 100 
\igfrnl ampicillin at 37°C overnight. 
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EXAMPLE 5 

Screening HTV-l Protease Mutant Libraries for Drug-Resistant Mutants 

5 I. Transformants are generated in LS743 and colonies are lifted onto 
82 mm diameter BA85 nitrocelluose disks (Schleicher & Schuell). 
The disks are removed from the plates and placed, colony-side-up, 
on 1 ml puddles consisting of: 860 ^1 1 M Tris-Cl pH 7.4/0.15 M 
NaCl, 138 \i\ dimethylsulfoxide (DMSO), and 2 ^1 of a 10 mM stock 

10 of protease inhibitor L-689,502 (Thompson, WJ. st ai., J. Med. 

Chem. 21> 1685 (1992)) in DMSO. After incubating 25 minutes, the 
filters are transferred to induction plates. 

The plates are prepared as follows: 

15 

A. To prepare medium,into a volume of 254 ml of H2O is dissolved 1.8 
g Na2HP04, 0.9 g KH2PO4, 0.15 g NaCl. 0.3 g NH4CI. and 0.6 g 
Difco casamino acids. The pH is brought to 7.4 with a few drops of 
ION NaOH. A quantity of 4.5 g of Bacto agar is added, and the 

20 media is autoclaved 20 minutes, then cooled to 55'C. The following 
are then added: 3 ml of 20% (w/v) glucose, 0.6 ml of 1 M MgS04, 
30 jil of 1 M CaCl2, 300 ^1 of 100 mg/ml ampicillin, 300 ^1 of 20 
mg/ml p-indoleaciylic acid (in 100% ethanol), 300 ^1 of 40 mg/ml 
XGal (5-bromo-4-chloro-3-indolyl-P-D-galactopyranoside 

25 (Boehringer-Mannheim)) in dimethylfonnamide, and 42 ml of 

DMSO. A volume of 20 ml of molten medium containing 78.4 ^il of 
a 5.10 mM stock of drug is poured per plate. The medium is 
allowed to solidify, and the plates are used immediately for the assay. 

30 A. The original master plates from the transformation are re-incubated 
at 37*0 until colonies are visible. These are stored at 4°C as sources 
of viable cells for mutant characterizations. 
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n. The filters are incubated on the induction plates for 24 hours at 
37°C. The resulting color of drug resistant colonies is a significantly 
lighter blue than drug sensitive (wild type) colonies. 

in. Any colonies exhibiting a lighter color than wild type colonies are 
recovered from the master plate and regrown for DNA sequencing, 
repeated color screening, and in vitro expression and 
characterization of drug resistance in the purified enzyme. 

While the foregoing specification teaches the principles of 
tfie present invention, with examples provided for the purpose of 
illustration, it will be understood that the practice of the invention 
encompasses all of the usual variations, adaptations, modifications, or 
deletions as come \yithin the scope of the following claims and its 
equivalents. 
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SEQUENCE LISTING 

(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 
TGAGGTGAGC TTTAACTTCC CTCAGATCAC TCT 33 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 33 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 
TCAAGAGTGA TCTGAGGGAA GTTAAAGCTC ACC 
(2) INFORMATION FOR SEQ ID N0:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA Primer 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 
GCTTTGCCTG GTTTCCG 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA Primer 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 
CGATCAAGCT TAAGCCGTAG ATAAACAGGC 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA Primer 

(iii) HYPOTHETICAL: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
ATCCTGGGTC TGGAGCTATT ATTTTTGACA CCAGACCAAC TG 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA Primer 

(iii) HYPOTHETICTO.: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
TTTTCGCTCA TGTGAAGT 
(2) INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA Primer 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:7: 
TGCGTTCTGA TTTAATCTG 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA Probe 

(iii) HYPOTHETICTOJ: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GTTTCCACAT GGGGATT 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 30 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA Primer 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
AGGAGGAATT CGACATATGC CTCAGATCAC 30 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA Primer 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CAGCCAAGCT TAGAAGTTCA GAGTGCAGCC 30 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA Primer 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
TAATACGACT CACTATA 17 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA Probe 

(iii) HYPOTHETICAL: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
ATTTAGGTGA CACTATA "^"^ 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 339 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Coding Sequence 

(iii) HYPOTHETICAL: NO 

(vi) IMMEDIATE SOURCE: Protease Gene From NY5 Strain of HIV-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

CGATAATGTA TGGATTAAAT AAGGAGGAAT AAGACATATG CCTCAGATCA 50 

CTCTGTGGCA GCGGCCGCTG GTTACTATCA AAATCGGTGG CCAGCTGAAA 100 

GAAGCTCTTC TAGACACTGG TGCTGACGAC ACTGTTCTCG AGGAAATGAA 150 

CCTGCCCGGG CGTTGGAAAC CTAAAATGAT CGGTGGTATC GGTGGTTTCA 200 

TCAAAGTTCG TCAGTATGAT CAGATCCTGA TCGAGATCTG CGGTCATAAA 250 

GCTATCGGTA CCGTTCTGGT TGGTCCTACT CCTGTTAACA TCATCGGTCG 3 00 

TAACCTGCTG ACCCAGATCG GCTGCACTCT GAACTTCTA 339 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: eunino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal fragment 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
Glu Val Ser Phe Asn Phe Pro Gin lie Thr L u Glu 

(2) INFORMATION FOR SEQ ID NO: 15: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal fragment 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
Ser Gin Asn Tyr Pro lie Val Gin 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 eunino acids 

(B) TYPE: amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal fragment 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
Ala Arg Val Leu Ala Glu Ala Met 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: internal fragment 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
Ala Thr lie Met Met Gin Arg Gly 

(2) INFORMATION FOR SEQ ID NO: 18: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal fragment 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
Pro Gly Asn Phe Leu Gin Ser Arg 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal fragment 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
Ser Phe Asn Phe Pro Gin lie Thr 

(2) INFORMATION FOR SEQ ID NO:20t 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: eunino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal fragment 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
Thr Leu Asn Phe Pro lie Ser Pro 
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(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECIJLE TYPE: 

(A) DESCRIPTION: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(v) FB^GMENT TYPE: internal fragment 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
Ala Glu Thr Phe Tyr Val Asp Gly 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 eunino acids 

(B) TYPE: amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal fragment 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
Arg Lys lie Leu Phe Leu Asp Gly 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: aunino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal fragment 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
Gin lie Thr Leu Trp Gin Arg Pro 
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(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal fragment 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
Asp Thr Val Leu Glu Glu Met Ser 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal fragment 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
Asp Gin lie Leu lie Glu lie Cys 
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WHAT IS CLAIMED IS : 

1 . The plasmid pPrBGl . 

2. The plasmid pPrBGl, inserted with a mutated 

protease. 

3. A color screen assay for drug-resistant HIV protease 
mutants, comprising the steps of: 

(a) plating out an E. coli recipient strain transformed with a library of 
mutagenized, full-length HIV protease sequences in pPrBGl, to give 
colonies on a master plate; 

(b) lifting said colonies onto nitrocellulose filters, to give colony Ufts; 

(c) treating the colony lifts with one or more inhibitors of HTV 
protease, to give treated colony lifts; 

(d) incubating the treated colony lifts in induction medium, said 
induction medium comprising suitable growth media and a color- 
producing substrate for beta-galactosidase, to give induced colony lifts; 
and 

(e) selecting those colonies from the master plate that correspond to the 
lighter colored colonies on the induced colony lifts; 

to give dmg-resistant HIV protease mutants. 

4. A color screen vector library of HIV protease 
mutants, said library comprising a mutagenized library of full-length 
HIV protease sequences in the color screen vector pPrBGl, said library 
prepared by the method comprising the steps of: 

(a) providing a quantity of single-stranded plasmid witii one fiill-length 
HIV protease sequence inserted thereto; 

(b) contacting said plasmid with one or more mutagens; 

(c) removing die mutagens, yielding mutagenized plasmid; 

(d) rendering the mutagenized HTV protease sequence double-stranded 
by primer extension of the mutagenized plasmid, yielding a double- 
stranded mutagenized HIV protease sequence; 
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(e) trinuning the ends of said double-stranded mutagenized HTV 
protease sequence with restriction endonucleases, yielding trimmed 
double-stranded sequences; 

(f) ligating the trimmed double-stranded sequences of step (e) into 
pPrBGl linearized by restriction endonuclease digestion between the tip 
promoter and the structural sequence for the cleavable reporter beta- 

galactosidase, 

to give a color screen vector library of HIV protease 

mutants. 
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